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Abstract 

Background: lh& rate of human immunodeficiency virus type 1 (HiV-l) infection in Iran has increased dramatically in the 
past few years. While the earliest cases were among hemophiliacs, injection drug users (IDUs) fuel the current epidemic. 
Previous molecular epidemiological analysis found that subtype A was most common among IDUs but more recent studies 
suggest CRF_35AD may be more prevalent now. To gain a better understanding of the molecular epidemiology of HIV-1 
infection in Iran, we analyzed all Iranian HIV sequence data from the Los Alamos National Laboratory. 

Methods: All Iranian HIV sequences from subtyping studies with pol, gag, em and full-length HIV-1 genome sequences 
registered in the HIV databases {www.hiv.lanl.gov) between 2006 and 2013 were downloaded. Phylogenetic trees of each 
region were constructed using Neighbor-Joining (NJ) and Maximum Parsimony methods. 

Results: l\toXa\ of 475 HIV sequences were analyzed. Overall, 78% of sequences were CRF_35AD. By gene region, CRF_35AD 
comprised 83% of HIV-1 poi, 62% of em, 78% of gag, and 90% of full-length genome sequences analyzed. There were 240 
sequences re-categorized as CRF_AD. The proportion of CRF_35AD sequences categorized by the present study is nearly 
double the proportion of what had been reported. 

Conclusions: Phylogenetic analysis indicates HIV-1 subtype CRF_35AD is the predominant circulating strain in Iran. This 
result differed from previous studies that reported subtype A as most prevalent in HIV- infected patients but confirmed 
other studies which reported CRF_35AD as predominant among IDUs. The observed epidemiological connection between 
HIV strains circulating in Iran and Afghanistan may be due to drug trafficking and/or immigration between the two 
countries. This finding suggests the possible origins and transmission dynamics of HIV/AIDS within Iran and provides useful 
information for designing control and intervention strategies. 
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Introduction 

Important characteristics that contribute to the worldwide 
spread of HIV are its enormous genetic variability and rapid 
evolution, wliich makes the virus highly adaptable to selection 
pressures of new hosts. The high error rate of the reverse 



transcriptase which lacks a proofreading mechanism, high rates of 
virus production in vivo, persistent nature of infection and 
selective immune pressure are factors responsible for the high 
genetic variation of HIV. [1] The presence of viral RNA as a 
dimer and co-infection of a cell with more than one viral genotype 
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also results in recombination and mixed genotypes. Together, this 
presents a complex picture of genetic variation of HIV-1 virus. [2] 

The genetic variability of HIV affects pathogenesis, immune 
response and escape, vaccine development, transmission, disease 
progression, drug resistance and treatment response. [3,4] 
Therefore, molecular epidemiology studies are extremely impor- 
tant to characterize the HIV-1 subtype distribution in a specific 
population/region which may significantly influence diagnostic 
and therapeutic strategies. [5] 

HIV-1 has four distinct genetic groups: M, N, O, and P. When 
the genetic groups are presented on a phylogenetic tree, strains 
within group M form well-defined clusters. Nine distinct subtypes 
(A-D, F-H, J, and K) have been identified, along with 61 
circulating recombinant forms (CRFs) which are inter-subtype 
recombinant strains. [6] 

The rate of human immunodeficiency virus typel (HIV-1) 
infection in Iran has increased dramatically in the past few years. 
While the earliest cases were among hemophiliacs, injection drug 
users (IDUs) fuel the current epidemic. According to the CDC, a 
total of 26,556 PLWH had been identified in Iran through June 
20 13. [7-9] The HIV transmission routes in all the cases registered 
since 1986, in order of magnitude, are sharing injection equipment 
among IDUs (68.4%), sexual intercourse (12.3%), blood transfu- 
sion (0.9%), and mother-to-child transmission (1.2%); the route of 
transmission for the remaining 17.2% is unknown. [7] 

In Iran, previous molecular epidemiological analysis of HIV- 
1 gag and env gene segments found that the predominant strain 
circulating among IDUs was subtype A which was related to 
African Ugandan/Kenyan sub-Saharan isolates. [10-13] More 
recent studies oipol, gag and env gene segments reported that the 
predominant strain was CRF_35AD.[6,14— 16] To gain a better 
understanding of the molecular epidemiology of HIV- 1 infection 
in Iran, we analyzed all Iranian HIV sequence data from the Los 
Alamos National Laboratory. 

Methods 

A secondary analysis was performed using all Iranian HIV 
sequences from subtyping studies with pol, gag, env and fuU-length 
HIV-1 genome sequences registered in the HIV databases at the 
Los Alamos National Laboratory (www.hiv.lanl.gov) between 2006 
and 2013. The sequences were downloaded along with reference 
nucleotide sequences for those regions [accession numbers: 
AB703607-AB703616, AY693842-AY693971, DQP77824- 
DQP77851, DQP77854-DQP77871, DQ^l 15645-DQ,1 15707, 
DQ149128-DC)149133, DQ788541-DQ,788560, EU881931, 
FJl 78375, FJl 78376, FJ392730-FJ392755, FJ790242, FJ807629, 



FJ807630, GQ243705-GQ243708, GQ273945-GQ273960, 
GQ274861-GQ274878, GQ853425-GQ853428, GU724804- 
GU724845, HQ233645, HQ,735066, HQ.735068, KF029508- 
KF029592, EF158040- EF158043, GQ477442-GQ477451, 
DQ676872, AB253421, AB253429, AF286238, GU201516, 
AF286237, K03455, AY523387, AY173951, AY331295, 
U52953, U46016, AF067155, AY772699, K03454, AY371157, 



AY253311, 

AJ249238, 

AF084936, 

AF190128, 

AF082394, 

AY271690, 

AFl 19819, 

AF064699, 

AF286230, 

AY093603, 

AF492624, 

AF450096, 

AF286239, 

AY586541, 

AY586545, 

GQ229529, 

AJ670526, 

FM877782, 

DQP85873, 

EF091932, 

DQ.366659, 

EFl 58041, 

FJ2 13781, 

EU735535, 

EU697904, 

FN392876, 

GQ,372987, 



U88824, 
AY371158, 

AF061641, 
AF005496, 
AJ249239, 

AB485636, 
AFl 19820, 
AY535659, 
HM067748, 

AY093607, 
AF408629, 
AF450097, 
EU581825, 
AY894993, 
AY945737, 
AY900571, 
EU693240, 
FM877777, 
DQP85874, 
AY727526, 

DQ.366662 
EF087995, 
FJ2 13782, 
EU735538, 
EU697907, 
FN392877, 
FJ670529, 



AF077336, 
AJ249236, 
U88826, 
FJ711703, 
GQ477441, 

L39106, 
AF076998, 
AB286851, 
AY008715, 



AF005494, 
AJ249237, 

AY6 12637, 

EF614151, 
GU564221 

AF193276, 
AF193253, 
EF368372, 
AJ866553, 



AF289548-AF289550, 
AF408630, AF385936, 



DQ354120, 
EU581827, 
AY588971, 
AF457051, 
AY900572, 
EU697906, 
AJ404325, 
DQP85876, 
AY727527, 
, EF165541, 
EF087994, 
FJ2 13780, 
EU735540, 
EU697909, 
DQ358801, 
HQ385477, 



AF516184, 
EU581828, 
AY588970, 
AF457072, 
AY900574, 
EU697908, 

AM851091, 
AY771590, 
AY535660, 
EFl 58043, 
EFl 16594, 

EU735534, 
EU735339, 
FJ358521, 

DQ,358802, 

H(;i385479, 



AF075703, 
AF377956, 
AF190127, 
GU237072, 
, U54771, 
AF049337, 
AY227107, 
EF368370, 
AY093605, 
AF492623, 
DQ845388, 
AY945736, 
AF377959, 
AY894994, 
AY371159, 
AY900575, 
FM877780, 
DQP85872, 
DQP85871, 
AB547464, 
EF158040, 
AF377957, 
EU735536, 
EU170155, 
FN392874, 
HM026456, 
HQ385478, 



L20587, L20571, AY169812, AJ302647, AY532635, AJ006022, 
AJ271370, HQ,1 79987, GUI 11555, U42720, DQ373066, 
AF103818]. Pol, gag and env genes have been shown to be 
reliable regions for HIV-1 subtyping. [8,17-19] 

Nucleotide sequences from each gene were aligned with the 
reference sequences using CLUSTAL W software. Phylogenetic 
trees were constructed using Neighbor-Joining (NJ) and Maximum 
Parsimony methods (1000 times bootstrap replicates) with 
Molecular Evolutionary Genetics Analysis (MEGA) software 
version 5. [20] The Kimura 2-parameter model was used with a 
transition/transversion ratio of 1.5 and statistical support of the 
tree structures was obtained with 1000 bootstrap rephcates. 
Significance was based on bootstrap values of >70%. [20] 



Table 1. HIV-1 subtype classification by gene region of HIV-1 Iranian sequences from the HIV databases (www.hiv.lanl.gov). 





HIV-1 Genes 


Subtype Number (%) 


pol 


env 


gag 


Full-length genome 




CRF_35AD 144 (82.8%) 


CRF_35AD 100 (62.1%) 


CRF_35AD 101 (77.7%) 


CRF_35AD 9 (90.0%) 




CRF_AE 2 (1.1%) 


CRF_AE 1 (0.6%) 




AE 1 (10.0%) 


A1 2 (1.5%) 




B 25 (14.4%) 


B 54 (33.5%) 


B 27 (20.7%) 




CRF_29BF 2 (1.1%) - - - 




C 1 (0.6%) 


C 6 (3.7%) 






Total 


174 


161 


130 


10 



doi:l 0.1 371 /journal.pone.Ol 05098.t001 
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35-AD.AF.2007.169H.GQ477446 

35-AD.IR.2010.10IR.THR09F.AB703608 

35-AD.IR.2011.11IR.KSH24F.AB703609 

35-AD.IR.2011.11IR.KSH29F.AB703611 

35-AD.IR.2011.11IRSYZ33F.AB703613 

35-AD.AF.2005.05AF104.EF158042 

35-AD.IR,2011.11IR.SYZ27F.AB703610 

35-AD.AF.2005.05AF095.EF158CI41 

35-AD.IR,2011.11IR.SYZ33F.AB703614 

35-AD.AF.2005.05AF026.EF158043 

35-AD. A F. 2006.047H. GQ477443 

35-AD.IR,2010.10IR.THR41F.AB703615 

35-AD.AF.2007.273H.GQ477448 

35-AD.AF.2005.05AF094.EF15804D 

35-AD.IR.2011.11IR.KSH31F.AB703612 

35-A D. A F. 2006.073H. GQ477445 

35-AD. A F. 2006.077H. GQ477442 

35-AD.IR.2010.10IR.THR01F.AB703607 

35-AD. A F. 2006. 007H. GQ477450 

35-AD. AF. 2007.221 H.GQ477449 

35-A D. A F. 2006.027H. GQ47745 1 

35-A D.AF. 2006-051 H.GQ477444 

A1.SE.1994.SE7253.AF059670 

A1.UG.1992.92UG037.U51190 

A1.UG.1998.98UG57136.AF484509 

A1.KE. 1994.023-17. AF004aa5 

A1.KE.1997.ML752.AY322193 

A1.UG.1985.U455-U455A.M62320 

A2.CD.19g7.97CDK7B48.AF286238 

A2.CY.1994.94CY017^1.AF286237 

02 AG.EC.-.ECU42.AY151002 

45 cpx.CD.1997.97CD MBS30.FN392875 

G.KE.1993.HHa793-1-1.AF061640 

G.KE.1993-HHa793-12-1.AF061641 

G . B E . 1 996. DRCB L AF034936 

G . NG . 1 992. 92 NG083-JV 1 0832. U88a26 

G.SE.1993.SE6155-G6165.AF061642 

C.BW.1996.96BW0502.AF1 10967 

G. IN. 1 995.95IN21 068. AF0671 55 

C.ZA.2004.04ZASK146.AY772699 

C.BR. 1992. BR025-d.U52953 

C.ET. 1986. ETH2220. U4601 6 

C. BW.1996.96BW17.AF1 10980 
H-BE.1993.VI991.AF190127 

BE.1993.VI997.AF190128 
CF.ig90.056.AF005496 
J.SE-1993.SE9280-7887.AF082394 
J.SE.1994.SE9173-7022.AF082395 
F 1 . BE . 1 993. V 1850. AF077335 
F1.FI.1993.FIN9363.AF075703 
F1.BR.1993.93BR020-1.AF005494 
F1 . FR. 1 996.96FR-MP41 1 .A J249238 
F2.CM.1995.95CM-MP255.AJ249236 
F2. CM . 1 995. 95CM-M P257. A J24g237 
F2 . CM . 1 997. CM 53657 . AF377956 
F2.CM.2002.02CM-0016BBY.AY371158 
K-CD. 1997.97ZR-EQTB11 .AJ249235 
CM.1996.96CM-MP535.AJ249239 
D-CD.1933.ELI.K03454 

D. CD.1933.NDK.M27323 
D.CD.19a4.84ZR085.U88a22 
D.CM.2001.01CM^412HALAY371157 
D.TZ2001.A280.AY253311 

UG.1994-94UG114.U88824 
B . NL . 2000. 671 -00T36. AY423387 
B.US.1933.RF-HAT3.M17451 
B. US. 1998.1053-1 1.AY331295 
B.US.1990.WEAU160-GHOSH.U21135 
B.TH.1990.BK132.AY173951 
B.FR.1983.HXB2-1_AI-IIIB-BRU.K03455 
B.US.1936.JRFL-JR-FL.U63632 



Figure 1. Un-rooted phylogenetic tree of CRF_AD whole 
genome sequences from Iran and Afghanistan, constructed 
using Kimura 2-parameter matrices and Neighbor-Joining 
method. 

doi:10.1371/journal.pone.0105098.g001 
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To confirm the results obtained using MEGA5, sequences were 
re-analyzed using REGA. To improve the accuracy of the 
characterization of recombinant forms, Maximum Likelihood 
and NJ trees were re-constructed using RDP v.4.35 software. 
Results from all the different analyses were compared to determine 
the final subtype characterizations. 

Results 

A total of 475 Iranian HIV-1 sequences were analyzed in this 
study, of which 174 sequences were ior pol genes, 161 sequences 
for env genes, 130 sequences ior gag genes, and 10 sequences for 
full-length genomes. Overall, CRF_35AD was the predominant 
subtype representing 78% of sequences. By region, CRF_35AD 
comprised 83% of HIV- 1 pol sequences, 62% oi env sequences, 
78% oi gag sequences, and 90% of full-length genome sequences 
analyzed. 

Table 1 presents the distribution of subtypes in the present 
study. There were 240 sequences re-categorized as CRF_AD and 
2 sequences re-categorized as CRF_29BF. The proportion of 
CRF_35AD sequences categorized by the current study is nearly 
double the proportion of what had been reported in the HIV 
databases. Of the HIV-1 pol gene sequences analyzed, 69 Al 
sequences (39.6%) in the Baesi and Hamkar studies and 
55 CRF_AD sequences (31.6%) in the SoheUi and Hamkar 
studies were re-categorized as CRF_35AD in present study and 2 
sequences (1.1%) which had been reported as subtype B in the 
Hamkar study were re-categorized as subtype CRF_29BF. 
[10,14,16] Of the HIV-1 env gene sequences, 15 sequences 
(9.3%) which had been reported as subtype Al in the Bahmani 
and Khosravi studies were re-classified as subtype CRF_35AD. 
[20] Of the HIV-1 g'ag'gene sequences, 101 Al sequences (77.7%) 
in the Naderi and Sarami studies were re-categorized as subtype 
CRF35_AD. [11,12] 

A phylogenetic tree of HIV whole genome sequences from Iran 
and Afghanistan is shown in Figure 1. The 13 Afghani isolates 
were from the mid- to late 2000's whereas the 9 Iranian isolates 
were from the early 2010's. In the three pairs that were comprised 
of an isolate from Afghanistan clustering with an isolate from Iran, 
the support values were low. 

Discussion 

In this phylogenetic analysis of sequences in the Iranian 
population, HIV-1 subtype CRF_35AD was found to be the 
dominant circulating strain. The result of this analysis differed 
from previous studies which reported subtype A as the most 
prevalent in HIV- infected patients in Iran but confirmed the 
results of other studies which reported the predominance of HIV- 
1 CRF_35AD among Iranian IDUs. The phylogenetic analysis 
also identified 2 cases of CRF_29BF. The observed differences 
between our findings and those of previous studies may be due to 
the unavailability of HIV reference sequences for certain subtypes, 
e.g., CRF_29BF, in the HIV databases in previous years or 
ahgnments performed using reference sequences that were not 
representative of all subtypes in Iran. 

The identification of CRF_35AD and CRF_29BF strains 
circulating in Iran are likely the result of the importation of these 
strains from other countries. The observed epidemiological 
connection between HIV strains circulating in Iran and Afghani- 
stan may be due to drug trafficking and/ or immigration between 
these two countries. Iran is a major route for drug trafficking 
between Afghanistan and Europe. In addition, Iran has received a 
large number of Afghan refugees. Since CRF_35AD is also the 
dominant strain among Afghan IDUs, it is possible that the 
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observed expansion of CRF_35AD is due in part to Afghan IDUs 
who immigrated to Iran. [6,21,22] CRF_29 strains in Iran may 
have originated from South America where this subtype is more 
commonly found. 

Our findings suggest the possible origins and transmission 
dynamics of HIV/ AIDS within Iran. Knowing the distribution of 
HIV variants alongside the corresponding epidemiologic factors 
will help assess the implications of any differences in transmissi- 
bility. The public health implications of such findings, including 
prevention and treatment strategies, are of special interest. 
According to the latest report from CDC, injection drug use 
remains the primary transmission route of HIV infection in the 
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